NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robust First- and Second-Order Differentiation for Regularized Optimal Transport

https://doi.org/10.1137/24M1674030

Li, Xingjie; Lu, Fei; Tao, Molei; Ye, Felix X-F (June 2025, SIAM Journal on Scientific Computing)

Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, including the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a first-order optimizer, which requires the gradient of the OT distance. For faster convergence, one might also resort to a second-order optimizer, which additionally requires the Hessian. The computations of these derivatives are crucial for efficient and accurate optimization. However, they present significant challenges in terms of memory consumption and numerical instability, especially for large datasets and small regularization strengths. We circumvent these issues by analytically computing the gradients for OT distances and the Hessian for the entropic OT distance, which was not previously used due to intricate tensorwise calculations and the complex dependency on parameters within the bi-level loss function. Through analytical derivation and spectral analysis, we identify and resolve the numerical instability caused by the singularity and ill-posedness of a key linear system. Consequently, we achieve scalable and stable computation of the Hessian, enabling the implementation of the stochastic gradient descent (SGD)-Newton methods. Tests on shuffled regression examples demonstrate that the second stage of the SGD-Newton method converges orders of magnitude faster than the gradient descent-only method while achieving significantly more accurate parameter estimations.
more » « less
Free, publicly-accessible full text available June 30, 2026
Challenges in Zero-Voltage-Switched-on Multi-MHz Multi-kW SiC Full-Bridge Inverter and Oriented Design for High-Power Capacitive Power Transfer System

https://doi.org/10.1109/JESTPE.2024.3525259

Wang, Yao; Zhao, Shuyan; Lu, Fei; Zhang, Hua (January 2025, IEEE Journal of Emerging and Selected Topics in Power Electronics)

Full Text Available
Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery

Yu, Yue; Liu, Ning; Lu, Fei; Gao, Tian; Jafarzadeh, Siavash; Silling, Stewart (December 2024, 38th Conference on Neural Information Processing Systems (NeurIPS 2024))

Despite the recent popularity of attention-based neural architectures in core AI fields like natural language processing (NLP) and computer vision (CV), their potential in modeling complex physical systems remains underexplored. Learning problems in physical systems are often characterized as discovering operators that map between function spaces based on a few instances of function pairs. This task frequently presents a severely ill-posed PDE inverse problem. In this work, we propose a novel neural operator architecture based on the attention mechanism, which we refer to as the Nonlocal Attention Operator (NAO), and explore its capability in developing a foundation physical model. In particular, we show that the attention mechanism is equivalent to a double integral operator that enables nonlocal interactions among spatial tokens, with a data-dependent kernel characterizing the inverse mapping from data to the hidden parameter field of the underlying operator. As such, the attention mechanism extracts global prior information from training data generated by multiple systems, and suggests the exploratory space in the form of a nonlinear kernel map. Consequently, NAO can address ill-posedness and rank deficiency in inverse PDE problems by encoding regularization and achieving generalizability. We empirically demonstrate the advantages of NAO over baseline neural models in terms of generalizability to unseen data resolutions and system states. Our work not only suggests a novel neural operator architecture for learning interpretable foundation models of physical systems, but also offers a new perspective towards understanding the attention mechanism. Our code and data accompanying this paper are available at https://github.com/fishmoon1234/NAO.
more » « less
Full Text Available
Gallium Nitride (GaN) based High-Power Multilevel H-Bridge Inverter for Wireless Power Transfer of Electric Vehicles

https://doi.org/10.1109/ITEC60657.2024.10598884

Chevinly, Javad; Rad, Shervin Salehi; Nadi, Elias; Proca, Bogdan; Wolgemuth, John; Calabro, Anthony; Zhang, Hua; Lu, Fei (June 2024, IEEE)

Full Text Available
Power Loss Investigation of Pavement Materials in Roadway Inductive Charging System

https://doi.org/10.1109/APEC48139.2024.10509424

Zheng, Zilong; Wang, Yao; Chen, Xiao; Zhao, Shuyan; Rad, Shervin Salehi; Zhang, Hua; Wang, Hao; Lu, Fei (February 2024, IEEE)

Full Text Available
Nonparametric Learning of Kernels in Nonlocal Operators

https://doi.org/10.1007/s42102-023-00105-9

Lu, Fei; An, Qingci; Yu, Yue (August 2023, Journal of Peridynamics and Nonlocal Modeling)

Nonlocal operators with integral kernels have become a popular tool for designing solution maps between function spaces, due to their efficiency in representing long-range dependence and the attractive feature of being resolution-invariant. In this work, we provide a rigorous identifiability analysis and convergence study for learning kernels in nonlocal operators. It is found that kernel estimation is an ill-posed or even ill-defined inverse problem, leading to divergent estimators in the presence of modeling errors or measurement noises. To resolve this issue, we propose a nonparametric regression algorithm with a novel data-adaptive RKHS Tikhonov regularization method based on the function space of identifiability. The method yields a noisy-robust convergent estimator of the kernel as the data resolution refines, on both synthetic and real-world datasets. In particular, the method successfully learns a homogenized model for stress wave propagation in a heterogeneous solid, revealing the unknown governing laws from real-world data at the microscale. Our regularization method outperforms baseline methods in robustness, generalizability, and accuracy.
more » « less
Full Text Available
High-Frequency Power Loss Investigation of Pavement Materials in Roadway Inductive Charging System

https://doi.org/10.1109/TTE.2025.3528227

Zheng, Zilong; Chen, Xiao; Wang, Yao; Zhang, Hua; Wang, Hao; Lu, Fei (June 2025, IEEE Transactions on Transportation Electrification)
Enhanced Axial Misalignment Tolerance in a 10-kW Autonomous Underwater Vehicle Wireless Charging System Utilizing a Split Solenoid Coupler

https://doi.org/10.1109/TPEL.2024.3431285

Mostafa, Amr; Wang, Yao; Lu, Fei; Zhang, Hua (October 2024, IEEE Transactions on Power Electronics)
NySALT: Nyström-type inference-based schemes adaptive to large time-stepping

https://doi.org/10.1016/j.jcp.2023.111952

Li, Xingjie; Lu, Fei; Tao, Molei; Ye, Felix X.-F. (March 2023, Journal of Computational Physics)

Full Text Available
Unsupervised learning of observation functions in state space models by nonparametric moment methods

https://doi.org/10.3934/fods.2023002

An, Qingci; Kevrekidis, Yannis; Lu, Fei; Maggioni, Mauro (January 2023, Foundations of Data Science)

Full Text Available

« Prev Next »

Search for: All records